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Summary 

The basic helix-loop-helix factor Myod initiates skeletal 
muscle differentiation by directly and sequentially activating 
sets of muscle differentiation genes, including those encoding 
muscle contractile proteins. We hypothesize that Pbx 
homeodomain proteins direct Myod to a subset of its 
transcriptional targets, in particular fast-twitch muscle 
differentiation genes, thereby regulating the competence of 
muscle precursor cells to differentiate. We have previously 
shown that Pbx proteins bind with Myod on the promoter of 
the zebrafish fast muscle gene mylpfa and that Pbx proteins 
are required for Myod to activate mylpfa expression and the 
fast-twitch muscle-specific differentiation program in 
zebrafish embryos. Here we have investigated the 
interactions of Pbx with another muscle fiber-type 
regulator, Prdmla, a SET-domain DNA-binding factor that 
directly represses mylpfa expression and fast muscle 
differentiation. The prdmla mutant phenotype, early and 
increased fast muscle differentiation, is the opposite of the 
Pbx-null phenotype, delayed and reduced fast muscle 
differentiation. To determine whether Pbx and Prdmla 
have opposing activities on a common set of genes, we used 



RNA-seq analysis to globally assess gene expression in 
zebrafish embryos with single- and double-losses-of-function 
for Pbx and Prdmla. We find that the levels of expression of 
certain fast muscle genes are increased or approximately wild 
type in pbx2/4-MO\prdmla—/— embryos, suggesting that Pbx 
activity normally counters the repressive action of Prdmla 
for a subset of the fast muscle program. However, other fast 
muscle genes require Pbx but are not regulated by Prdmla. 
Thus, our findings reveal that subsets of the fast muscle 
program are differentially regulated by Pbx and Prdmla. 
Our findings provide an example of how Pbx homeodomain 
proteins act in a balance with other transcription factors to 
regulate subsets of a cellular differentiation program. 

© 2013. Published by The Company of Biologists Ltd. This is 
an Open Access article distributed under the terms of the 
Creative Commons Attribution Non-Commercial Share Alike 
License (http://creativecommons.Org/licenses/by-nc-sa/3.0). 
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Introduction 

The differentiation of muscle precursor cells into contractile 
skeletal muscle fibers is necessary for normal muscle 
development and regeneration. Vertebrate skeletal muscles 
consist of a range of fiber types, from slow-twitch fibers 
needed for low-intensity, sustained aerobic activity, to fast-twitch 
fibers suited for short bursts of intense, anaerobic activity 
(Wigmore and Evans, 2002). Muscle fiber types are operationally 
defined by the expression of myosin heavy chain (MyHC) genes 
(Schiaffmo and Reggiani, 1996; Pette and Staron, 2000). In 
mammals, 1 1 MyHC genes contribute to skeletal muscle fiber 
heterogeneity and functional flexibility (Schiaffmo and Reggiani, 
201 1). However, muscle fiber types are distinguished not just by 
MyHC expression but by the expression of other sarcomeric 
proteins, most of which also exist as multiple isoforms, as well as 
by isoforms of intracellular calcium signaling components and 
levels of metabolic enzymes (Schiaffmo and Reggiani, 1996; 



Pette and Staron, 2000; Schiaffmo and Reggiani, 2011). Thus, it 
is the coordinated expression of fiber-type-specific gene 
expression programs that gives each fiber type its unique 
contractile and metabolic properties. 

Ultimately, the expression of skeletal muscle differentiation 
genes, including those encoding the MyHCs and other sarcomeric 
proteins, is controlled by the family of myogenic basic helix- 
loop-helix (bHLH) transcription factors: Myf5, Myod, Myf6 and 
Myog (Buckingham, 2001; Tapscott, 2005). In particular, Myod 
can directly bind to the promoters and activate the expression of 
both slow and fast MyHCs (Cao et al., 2010). How do factors like 
Myod activate their transcriptional targets in a temporally and 
spatially controlled manner to drive cell-type-specific, or fiber- 
type-specific, differentiation? Our work has shown that Pbx 
TALE-class homeodomain proteins function as cofactors for 
Myod and regulate a subset of Myod target genes, in particular to 
promote fast-muscle-specific differentiation (Berkes et al., 2004; 
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Maves et al., 2007). We have shown that Pbx proteins bind with 
Myod on the promoters of a subset of Myod target genes, 
including Myog and the fast muscle gene mylpfa, and that Pbx 
proteins are required for Myod to activate myog and mylpfa 
expression and the fast-muscle-specific differentiation program 
in zebrafish embryos (Berkes et al., 2004; Maves et al., 2007). 
Our specific hypothesis is that Pbx homeodomain proteins 
function as pioneer transcription factors to direct Myod to a 
subset of its transcriptional targets, in particular fast muscle 
differentiation genes, thereby regulating the competence of 
muscle precursor cells to differentiate into a specific fiber type. 

Pioneer factors are transcription factors that have the ability to 
access and bind DNA in repressive chromatin and that bind prior 
to gene activation and prior to the presence of other transcription 
factors (Zaret and Carroll, 201 1). Pbx proteins have recently been 
shown to function as pioneer factors in breast cancer cells, where 
PBX1 binds prior to, and is required for, Estrogen Receptor 
binding (Magnani et al., 2011). Supporting the role of Pbx 
proteins as pioneer factors in skeletal muscle differentiation, Pbx 
proteins can bind silent Myod target gene promoters prior to 
Myod binding and muscle differentiation (Berkes et al., 2004). 
Pioneer factors can function in passive roles, in which their 
presence reduces the number of additional transcription factor 
binding events needed, or in more active roles, in which they 
facilitate chromatin opening that is necessary before other factors 
can bind (Zaret and Carroll, 2011). Whether Pbx proteins act in 
passive or active roles as pioneer factors is not known. 

To address how Pbx proteins function in muscle fiber-type 
differentiation, here we use zebrafish to examine how Pbx 
interacts with another fiber-type regulator, Prdmla. In zebrafish, 
slow-twitch fibers derive from the adaxial cells, the most medial 
paraxial mesoderm cells, whereas more lateral somitic cells will 
give rise to fast-twitch fibers (Devoto et al., 1996). Adaxial cells 
express the transcriptional repressor Prdmla/Blimpl, which is 
both necessary and sufficient for slow MyHC expression and 
specification of slow twitch fiber identity (Baxendale et al., 
2004). In zebrafish, many fast muscle genes are initially 
transiently co-expressed with slow muscle genes in the adaxial 
cells before differentiation is completed (Bryson Richardson et 
al., 2005; Burguiere et al., 2011). In prdmla mutant zebrafish 
embryos, the expression of smyhcl and other slow muscle genes 
is reduced in the adaxial cells, while fast muscle genes show 
increased and premature expression in the adaxial cells (Roy et 
al., 2001; Baxendale et al., 2004; von Hofsten et al., 2008; Liew 
et al., 2008). Pbx and Prdmla thus have opposite requirements 
for fast muscle differentiation in zebrafish. Here we combine loss 
of function of both Pbx proteins and Prdmla to test the 
requirements for Pbx in promoting fast muscle differentiation. 
We use RNA-seq to identify subsets of the fast-muscle program 
that are either co-regulated by Pbx and Prdml or independently 
regulated by these factors. We find that in embryos lacking both 
Pbx and Prdmla, the levels of expression of many fast muscle 
genes are approximately wild type or are increased relative to 
wild type, suggesting that, for a subset of the fast muscle 
program, a major role for Pbx is to counter the repressive action 
of Prdmla. However, certain fast muscle genes require Pbx but 
are not regulated by Prdmla. Thus, our findings reveal that 
subsets of the fast muscle program are differentially regulated by 
Pbx and Prdmla. Our findings provide an example of how Pbx 
pioneer factors act in a balance with other transcriptional 
regulators to direct a specific cellular differentiation pathway. 



Results 

prdmla is epistatic to pbx2/4 for fast muscle gene expression in 
adaxial cells 

One advantage of using zebrafish to study Pbx functions in early 
muscle differentiation is that we have the ability to remove all Pbx 
function by using morpholinos to knock down maternal and 
zygotic expression of both pbx2 and pbx4 (pbx2/4), the two pbx 
genes expressed in young zebrafish embryos (Waskiewicz et al., 
2002; Maves et al., 2007). pbx2/4 and prdmla have opposite 
requirements for fast muscle gene expression in zebrafish: in the 
absence of Pbx2/4, fast muscle gene expression in both adaxial 
cells and lateral cells is reduced and delayed, whereas in the 
absence of Prdmla, fast muscle gene expression occurs 
prematurely and at increased levels in adaxial cells (Maves et 
al., 2007; Roy et al, 2001; Baxendale et al, 2004; von Hofsten et 
al., 2008; Liew et al., 2008). We wanted to combine loss of 
function of both pbx2/4 and prdmla to test the requirements for 
Pbx in promoting fast muscle gene expression. We reasoned that if 
Pbx proteins, as pioneer factors, play a necessary role in 
assembling factors to drive fast-muscle gene expression, then 
expression of fast muscle genes should be reduced and delayed in 
pbx2/4-MO;prdmla— /— embryos, as in pbx2/4-MO embryos. 
Alternatively, if, in the adaxial cells, Pbx proteins are primarily 
needed to overcome the repressive actions of Prdmla, then 
expression of fast muscle genes should appear at increased levels 
in pbx2/4-MO;prdmla—/— embryos, as in prdmla—/— embryos. 
We first confirmed that pbx2/4 and prdmla are not required for 
each other's expression (Fig. 1). We then examined the expression 
of three target genes in pbx2/4-MO;prdmla—/— double loss-of- 
function embryos: mylpfa, myhzl.3, and myog (Fig. 2). mylpfa and 
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Fig. I. pbx2/4 and prdmla are not required for each other's expression. 

(A,B) RNA in situ expression of prdmla and krox-20 in (A) control and 
(B) pbx2/4-MO embryos, prdmla expression in the adaxial cells along the axis of 
the embryo is maintained, while krox-20 expression is lost, in pbx2/4-MO 
embryos. Arrowheads point to adaxial cells, and r3 and r5 indicate krox-20 
expression in hindbrain rhombomeres. (C,D) Anti-Pbx (red) and DAPI (blue) 
staining, along with merged images, in (C) control and (D) prdmla—/— embryos. 
Pbx expression is maintained in prdmla—/— embryos. All embryos are shown in 
dorsal view, anterior towards the left. Scale bars: (A) 50 |rm, (C) 10 urn. 
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Fig. 2. Expression of Pbx and Prdmla target genes in pbx2/4-MO;prdmla—/— embryos. (A,C,E) RNA in situ expression of (A) mylpfa and krox-20, (C) myhzl.3 
and krox-20, and (E) myog and krox-20 in green, stnyhcl in red, and merged images in control, pbx2/4-MO, prdmla—/—, and pbx2/4-MO;prdmla—/— embryos. 
krox-20 expression is included as a control for the pbx2/4-MO knockdown (Maves et al., 2007). r3 and r5 indicate krox-20 expression in hindbrain rhombomeres. 
smyhcl is downregulated in prdmla—/— embryos (Elworthy et al., 2008; von Hofsten et al., 2008). (A,C) embryos are at 14 somites, (E) embryos are at 10 somites. 
All embryos are shown in dorsal view, anterior towards the left. Somites were counted in all embryos shown to confirm consistent staging. (B,D,F) Graphs of 
real-time RT-PCR (qRT-PCR) expression levels for (B) mylpfa, (D) myhzl.3, and (F) myog. Expression levels were normalized to odd expression. Error bars 
represent standard deviation. Somite stages (14 s, 10 s) are shown. Scale bar: 50 \un. 



myhzl.3 encode fast muscle sarcomere proteins whose expression 
is positively, and, in the case of mylpfa, directly, regulated by 
Pbx2/4 and directly repressed by Prdmla (Fig. 2A-D) (Maves et 
al, 2007; von Hofsten et al., 2008; Liew et al., 2008). myog, which 
encodes a muscle regulatory transcription factor, is directly and 
positively regulated by Pbx and shows upregulation in the adaxial 
cells of prdmla—/— embryos, indicating that myog is normally 
repressed by Prdmla (Fig. 2E,F) (Maves et al., 2007; Maves et al., 
2009). In pbx2/4-MO;prdmla— /— embryos, we find upregulation 
of mylpfa, myhzl.3, and myog in the adaxial cells relative to the 
controls (Fig. 2A,C,E), resembling the phenotype of prdmla—/— 
embryos. Using real-time PCR (qRT-PCR) to quantitatively 
examine these markers, we find that mylpfa and myhzl.3 
expression are strongly upregulated in pbx2/4-MO;prdmla—/— 
embryos relative to both control and pbx2/4-MO embryos, 
although not quite to the levels seen in prdmla—/— embryos 
(Fig. 2B,D). For myog, we find that the level of expression in pbx2/ 
4-MO;prdmla—/— embryos is close to the control level (Fig. 2F), 
and, based on the in situ pattern, seems to represent an upregulation 
of myog in the adaxial cells of pbx2/4-MO;prdmla—/— embryos 
but absence of the more lateral Pbx-dependent expression 
(Fig. 2E). These results show that expression of mylpfa, 
myhzl.3, and myog within the prrfm/a-expressing adaxial cells is 
upregulated in pbx2/4-MO;prdmla—/— embryos, resembling the 
prdmla—/— phenotype. 

To more thoroughly examine the Pbx- and Prdm la-regulated 
expression of mylpfa, we examined higher magnification 
confocal images. In control embryos at 14s, mylpfa is 
expressed in both lateral cells and in adaxial cells, where its 



expression coincides with the slow muscle gene smyhcl 
(Fig. 3 A) (Burguiere et al., 2011). In pbx2/4-MO embryos, 
both adaxial and lateral expression domains are lost (Fig. 3B), 
showing that Pbx is needed to promote mylpfa expression in both 
adaxial and lateral cells. In prdmla—/— embryos, adaxial and 
lateral expression of mylpfa are present (Fig. 3C). Critically, in 
pbx2/4-MQ;prdmla—/— embryos, adaxial expression of mylpfa 
is maintained, as in prdml—/— embryos, whereas the lateral 
domain is reduced or absent as in pbx2/4-MO embryos (Fig. 3D). 
Taken together, the results in Figs 2 and 3 show that prdmla is 
epistatic to pbx2/4 for fast muscle gene expression in adaxial 
cells, revealing that pbx2/4 functions as a genetic suppressor of 
prdmla. 

RNA-seq approach to identify Pbx2/4- and Prdml a-regulated 
genes: comparison with previous expression array studies 

We next wanted to ask whether prdmla functions epistatically to 
pbx2/4 broadly across the fast-muscle gene expression program 
or only at subsets of fast muscle genes. We confirmed that Pbx2/4 
and Prdmla could independently regulate known dependent 
genes by showing that krox-20 is strongly downregulated in pbx2/ 
4-MO and pbx2/4-MO;prdmla—/—, but not in prdmla—/—, 
embryos, and that smyhcl is strongly downregulated in 
prdmla—/— and pbx2/4-MO;prdmla— /— , but not in pbx2/4- 
MO, embryos (Fig. 2A, Fig. 4) (Waskiewicz et al., 2002; 
Elworthy et al., 2008; von Hofsten et al., 2008). To globally 
assess which subsets of the fast muscle program are regulated by 
both Pbx2/4 and Prdmla versus those that are regulated by either 
Pbx or Prdmla, we used high-throughput sequencing of mRNA 
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Fig. 3. Adaxial cell expression of mylpfa is absent in pbx2/4-MO embryos 
but present in pbx2/4-MO;prdmla—/— embryos. RNA in situ expression of 
mylpfa (green) and smyhcl (red) in (A) control, (B) pbx2/4-MO, 
(C) prdmla—/—, and (D) pbx2/4-MO,prdm 1 a— /— embryos. Embryos are 14 
somites stage, shown in dorsal view, anterior towards the left. Somite numbers 
2-7 are shown for each embryo. Yellow lines indicate where optical sections 
are taken through the x and y planes (lower and right panels, respectively, each 
image). Dorsal-ventral (D-V) axes (z axis in xy view) are indicated. 
Arrowheads point to lateral domain of mylpfa expression, absent or reduced in 
pbx2/4-MO and pbx2/4-MO,prdm 1 a— /— (asterisk) embryos, smyhcl- 
overlapping mylpfa expression in adaxial cells is present in pbx2/4- 
MO,prdmla—/— embryos. Scale bar: 50 p:m. 

libraries (RNA-seq). We generated libraries for control prdmla+/ 
+ embryos, pbx2/4-MO embryos, prdmla—/— embryos and 
pbx2/4-MO;prdmla—/— embryos in triplicate at the 10 somite 
(s) stage (see Materials and Methods). We chose this stage 
because it is when early skeletal muscle differentiation initiates in 
zebrafish and we wanted to assess the earliest muscle 
requirements for Pbx2/4 and Prdmla. We also chose this stage 
in order to compare with previous expression array analyses that 
we performed on 10 s control and pbx2/4-MO embryos (Maves et 
al., 2007; Maves et al., 2009). Because we could not reliably sort 
live prdmla mutant embryos from their wild-type siblings at 
10 s, we developed a technique to genotype RNA samples from 
individual embryos (see Materials and Methods). We pooled 
equivalent-genotype RNA samples for our RNA-seq libraries 
(see Materials and Methods). 

As an initial assessment of the RNA-seq data, we examined 
results from pbx2/4-MO and prdmla—/— , each relative to 
control. Using the cut-offs of > 1.5-fold change and 
significance of P<0.05, we identified 637 genes that are 
downregulated in pbx2/4-MO embryos compared to controls 
(supplementary material Table SI). This group of 637 genes 
includes the Pbx targets mylpfa, myog, and krox-20 and includes 
72/1 1 1 (65%) of the genes that we had previously identified with 
expression arrays as downregulated in pbx2/4-MO 10 s embryos, 
including 10/12 genes that we had validated by in situ 
hybridization (supplementary material Table SI) (Maves et al., 
2007). Additionally, this group of 637 genes includes 6 genes that 
we previously identified as downregulated in pbx2/4-MO 
embryos with expression arrays at 18 s, but not with arrays at 



10 s, including mylpfa (supplementary material Table SI) 
(Maves et al., 2007). Thus, although it is not clear why our 
RNA-seq analysis did not identify all of the genes previously 
identified with the expression arrays at 10 s, overall it appears 
that the RNA-seq analysis is more sensitive than our previous 
expression array analysis, because we identified genes at 10 s 
from RNA-seq that the expression arrays only picked up at 18 s. 
We also identified 142 genes that are upregulated in prdmla—/— 
embryos compared to controls (supplementary material Table 
S2). This group of 142 genes includes the Prdmla direct targets 
mylpfa and myhz2 and known prdmla -regulated genes agr2, 
myll, and myhzl.3 (supplementary material Table S2) (von 
Hofsten et al., 2008; Hernandez-Lagunas et al., 2005; Burguiere 
et al., 2011). Altogether, our initial assessment shows that our 
RNA-seq analysis has identified Pbx2/4- and Prdml -dependent 
genes. 

RNA-seq analysis identifies subsets of the fast muscle program 
that are differentially regulated by Pbx2/4 and Prdmla 
Because Pbx2/4 and Prdmla have both been shown to broadly 
regulate the fast muscle gene expression program (Maves et al., 
2007; von Hofsten et al., 2008), we used the RNA-seq data to ask 
if Pbx2/4 and Prdmla each regulate completely overlapping sets 
of fast muscle genes. We compared the genes downregulated in 
pbx2/4-MO embryos (supplementary material Table SI) with the 
genes upregulated in prdmla—/— embryos (supplementary 
material Table S2). At this early muscle differentiation stage, 
we find 12 genes overlapping between these two groups 
(Fig. 5A,B). The overlapping genes include fast muscle genes 
atp2al, myll and mylpfa (Fig. 5B). We then examined the gene 
lists from supplementary material Tables SI and S2 and found 
additional fast muscle genes in either the downregulated-in-/?fa2/ 
4-MO set or the upregulated-in-/>rfi?m7a— /— set (Fig. 5B). These 
results show that pbx2/4 and prdmla regulate overlapping as well 
as distinct sets of fast muscle differentiation genes. 

We next examined the RNA-seq results for the target genes 
analyzed in Fig. 2, using the Integrative Genome Viewer 
(Robinson et al., 2011) to visualize the data. As expected from 
our previous in situs and qRT-PCR, mylpfa and myog are 
downregulated in pbx2/4-MO embryos, upregulated in 
prdmla—/— embryos, and remain upregulated in pbx2/4- 
MO;prdmla—/— embryos (Fig. 6A,B). For myhzl.3, expression 
is not detected in control and pbx2/4-MO embryos but is 
upregulated in prdmla—/— and pbx2/4-MO;prdmla—/— 
embryos (Fig. 6C; see also Discussion), krox-20 and smyhcl 
show expected expression patterns (Fig. 6D,E). These results 
further validate our RNA-seq analysis. 

To globally determine whether prdmla functions epistatically 
to pbx2/4 broadly across the fast-muscle gene expression 
program, or only at subsets of fast muscle genes, we compared 
the data sets for genes downregulated in pbx2/4-MO embryos 
with three different data sets from pbx2/4-MO;prdmla—/— 
embryos: 1, genes upregulated relative to control; 2, genes 
expressed at about control levels; and 3, genes downregulated 
relative to control. (Fig. 7A-C). The overlap of Set 1 contains 
only four genes, but two of these are fast muscle myosin genes, 
myll and mylpfa (Fig. 7A; supplementary material Table S3). 
The overlap of Set 2 contains fast muscle differentiation genes, 
including atp2al, a fast muscle SERCA Ca(2+)-ATPase (Wu et 
al., 1995), ryrlb (Hirata et al., 2007), and also myog (Fig. 7B; 
supplementary material Table S3). These analyses reveal that 
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Fig. 4. Pbx2/4 and Prdmla independently regulate genes. Graphs of qRT-PCR expression levels for (A,B) krox-20 and (C,D) smyhcl. Expression levels were 
normalized to odd expression. Error bars represent standard deviation. Somite stages (10 s, 14 s) are shown. 



many fast muscle genes that are Pbx-dependent are indeed 
"rescued", relative to pbx2/4-MO levels, or upregulated in 
pbx2/4-MO;prdmla—/— embryos. However, the overlap of Set 
3 also contains fast muscle genes, such as atpla2a, myomla 
and srl (Fig. 7C; supplementary material Table S3). Thus, 
certain fast muscle genes that are Pbx-dependent are still 
downregulated in pbx2/4-MO;prdmla—/— embryos. Taken 
together, our RNA-seq analysis reveals subsets of the fast 



muscle differentiation program that are differentially regulated 
by Pbx2/4 and Prdmla. 

Validation of fast muscle genes differentially regulated by Pbx 
and Prdmla 

We next validated genes from each of these three sets of loci that 
are differentially regulated by Pbx and Prdmla. For Set 1, we 
already demonstrated that mylpfa is upregulated in pbx2/4- 
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Fig. 5. pbx2/4 and prdmla regulate overlapping and distinct subsets of fast muscle differentiation genes, identified through RNA-seq. (A) Venn diagram 
identifying the intersection (12 genes) of the set of genes downregulated in pbx2/4-MO embryos (pbx2/4-MO<contml, supplementary material Table SI) with the set 
of genes upregulated in prdmla—/— embryos {prdmla— I— >control, supplementary material Table S2). (B) List of the 12 genes identified in A. Also shown are fast 
muscle genes regulated by Pbx2/4, independently of Prdmla, and those regulated by Prdmla, independently of Pbx2/4, as identified from supplementary material 
Tables S 1 and S2. The asterisk refers to fast myosin heavy chain genes that were not identified as Pbx2/4-dependent through RNA-seq, but that are known through 
additional validation to be Pbx2/4-dependent (see Fig. 2 and Discussion). 
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Fig. 6. Visualization and validation of 
RNA-seq analysis. Mapping of RNA- 
seq reads as viewed in the Integrative 
Genomics Viewer for (A) mylpfa, 
(B) myog, (C) myhzl.3, (D) krox-20, and 
(E) smyhcl. Mapped reads are 
represented by grey histograms. Blue 
bars at the bottom of each panel 
represent annotated exons and UTRs 
(blue boxes) and introns (blue lines with 
arrowheads for orientation of transcript 
from 5' to 3') for each gene. For C, 
myhzl.3 is not annotated but the 
genomic position is confirmed relative 
to neighboring annotated zebrafish 
myosin heavy chain genes and 
mammalian orthologues. For each gene, 
rows for reads from control, pbx2/4-MO, 
prdmla—/— , and pbx2/4- 
MO;prdmla—/— are shown, as in A. 



MO;prdmla—/— embryos (Fig. 2), and we confirmed by both in 
situ and qRT-PCR that myll is downregulated in pbx2/4-MO 
embryos, upregulated in prdmla—/— embryos, and upregulated 
in pbx2/4-MO;prdmla—/— embryos (Fig. 8A,B). For Set 2, we 
already demonstrated that myog is expressed at about control 
levels in pbx2/4-MO;prdmla— /— embryos (Fig. 2), and we 
confirmed that atp2al is downregulated in pbx2/4-MO embryos, 
upregulated in prdmla—/— embryos, and expressed at about 
control levels in pbx2/4-MO;prdmla— /— embryos (Fig. 8C,D). 
Set 3 contains many neural genes, such as krox-20, that are 
known to be Pbx-dependent (Waskiewicz et al., 2002; Maves 
et al., 2007; Choe et al., 2009), and we already confirmed 
that krox-20 is strongly downregulated in pbx2/4-MO and 
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Fig. 7. RNA-seq analysis identifies subsets of the fast muscle program that 
are differentially regulated by Pbx2/4 and Prdmla. (A-C) Venn diagrams 
showing overlap of genes whose expression is downregulated in pbx2/4-MO 
embryos relative to control (green circles) compared to (A) genes that are 
upregulated in pbx2/4-MO;prdml —/— embryos, (B) genes that are expressed at 
about control levels in pbx2/4-MO;prdm 1 —/— embryos, and (C) genes that are 
downregulated in pbx2/4-MO;prdml —/— embryos. The number of genes in 
each overlap is shown in parentheses; genes are listed in supplementary 
material Table S3. 



pbx2/4-MO;prdmla— /— embryos but not affected in 
prdmla—/— embryos (Fig. 2, Fig. 4). We have previously 
shown that atpla2a is downregulated in pbx2/4-MO embryos 
(Maves et al., 2007). We also validated that the fast muscle gene 
srl is downregulated in pbx2/4-MO embryos, not affected in 
prdmla—/— embryos, and remains downregulated in pbx2/4- 
MO;prdmla—/— embryos (Fig. 8D,E). This analysis 
demonstrates that certain fast muscle genes are Pbx-dependent 
but are not repressed by Prdmla, confirming that subsets of the 
fast muscle program have different requirements for Pbx and 
Prdmla. 

Discussion 

The coordinated expression of fiber-type-specific gene 
expression programs is necessary to give each fiber type its 
unique contractile properties. Pbx and Prdmla proteins have 
previously been shown to have opposite requirements for fast 
muscle gene expression in zebrafish (Maves et al., 2007; 
Baxendale et al., 2004; von Hofsten et al., 2008; Liew et al., 
2008). Here we combined loss of function of both Pbx and 
Prdmla to test the requirements for Pbx in promoting fast muscle 
differentiation. We used RNA-seq to identify subsets of the fast- 
muscle program that have different requirements for Pbx and 
Prdmla. Our results reveal that there are multiple combinations 
of Pbx and Prdmla inputs that regulate fast muscle gene 
expression (Fig. 9). In lateral cells, fast muscle differentiation 
genes are positively regulated by Pbx and Myod, likely acting 
together, as has been shown for mylpfa (Maves et al., 2007) 
(Fig. 9). In adaxial cells, myll is repressed by Prdmla but not 
regulated by Pbx, srl is positively regulated by Pbx and Myod but 
not regulated by Prdmla, and mylpfa expression appears to be 
regulated through a balance of Pbx/Myod positive input and 
Prdmla repression (Fig. 9 and references therein). Our results 
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Fig. 8. Validation of fast muscle genes differentially regulated by Pbx and Prdmla. (A,C,E) RNA in situ expression of (A) myll and krox-20, (C) atp2al and 
krox-20, and (E) srl and krox-20 in green, smyhcl in red, and merged images in control, pbx2/4-MO, prdmla—/—, and pbx2/4-MO;prdmla—/— embryos, krox-20 
expression is included as a control for the pbx2/4-MO knockdown (Maves et al., 2007). r3 and r5 indicate krox-20 expression in hindbrain rhombomeres. smyhcl is 
downregulated in prdmla—/— embryos (Elworthy et al., 2008; von Hofsten et al., 2008). (A,C) embryos are at 14 somites, (E) embryos are at 18 somites. All 
embryos are shown in dorsal view, anterior towards the left. Somites were counted in all embryos shown to confirm consistent staging. (B,D,F) Graphs of real-time 
RT-PCR (qRT-PCR) expression levels for (B) myll, (D) atp2al, and (F) srl. Expression levels were normalized to odd expression. Error bars represent standard 
deviation. Somite stage for qRT-PCR (14 s) is shown. Scale bar, 50 u.m. 



thus show that Pbx and Prdmla transcription factors differentially 
regulate subsets of the fast skeletal muscle differentiation 
program in zebrafish. 

Our results support the hypothesis that pioneer factors may 
function through different mechanisms depending on the target 
gene (Zaret and Carroll, 2011; Budry et al., 2012). For example, 
pioneer factors can play passive roles by reducing the number of 
additional transcription factor binding events needed for gene 
activation, or more active roles by opening up local chromatin to 
allow other factors to bind (Zaret and Carroll, 2011). Our 
findings suggest that, for a subset of fast muscle genes such as 
mylpfa, Pbx proteins are needed to repress the repressive activity 
of Prdmla (Fig. 9B), possibly by inhibiting Prdmla binding or 
activity at fast muscle gene promoters. Pbx proteins have been 
shown to inhibit gene expression through interactions with 
transcriptional repressors (Saleh et al., 2000; Berghella et al., 



2008; Choe et al., 2009), but activating gene expression by 
repressing a transcriptional repressor may entail a new 
mechanism for Pbx activity. Pbx proteins could take an active 
role in inhibiting Prdmla, while promoting gene expression, by 
modifying chromatin states at fast muscle gene promoters 
through Pbx interactions with chromatin factors such as Brgl 
or CBP (de la Serna et al., 2005; Choe et al., 2009). Other fast 
muscle genes, such as srl, are Prdm la-independent but require 
Pbx for their activation. Further studies will be needed to 
understand the mechanisms by which Pbx proteins regulate 
different subsets of the fast muscle program, in particular how 
Pbx proteins repress the activity of Prdmla. 

We used an RNA-seq approach in order to globally assess 
which subsets of the fast muscle program are co-regulated by 
Pbx2/4 and Prdmla versus those that are independently 
regulated. In general, we found that our RNA-seq approach 
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Fig. 9. Multiple combinations of Pbx and Prdmla inputs regulate fast muscle gene expression. Cartoons illustrating inputs for the expression of three fast muscle 
genes (A) myll, (B) mylpfa, and (C) srl. Evidence for inputs comes from A (Burguiere et al., 201 1; this work), B (Maves et al., 2007; von Hofsten et al., 2008; Liew 
et al., 2008; this work), and C (Maves et al., 2007; this work). Inputs reflect genetic requirements, although, in the case of mylpfa, direct regulation by Pbx, 
Myod and Prdmla has been demonstrated (Maves et al., 2007; von Hofsten et al., 2008; Liew et al., 2008). 
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Table 1. Primers used. 



Primer name 


Sequence 5 '-3' 


prdmladcapsF2 


TTGAGTGACTTTATAAGAGC 


prdmladcapsR2 


GCTCCTGCTG- 




GGAATGCGCTGGATTCACAGAACGCAT 


odcL2 


GACTTTGACTTCGCCTTCCT 


odcR2 


GAGGTGCTTCTTCAGGACATC 


mylpfaintFl 


CATACCGTCTCGACATGGCACCCA 


mylpfaintRl 


CCTCCTGCTGCCCTCCTCTTGG 


myhzl .3QF1 


GGAGGCTGAGGAGCGTGCTGATATAG 


myhzl.SQRl 


GTCACAGTTGTTGAACGTCGTTTACTC 


myogLl 


CTGGGGTGTCGTCCTCTAGT 


myogRl 


TCGTCGTTCAGCAGATCCT 


krox-20newFl 


CGGTGGACGAGCTTGCCACA 


krox-20newRl 


CCGTTAATCAGGCCATCTCCTGCG 


smyhclnewF3 


CCAAGCTAACCCTGCTCTCGAGG 


smyhclnewRi 


TGCTGAGGCCAGTTTTCCACTCAC 


myllF2 


GTGATGGGTGCTGAGCTGCGT 


myllR2 


GCCGTTTTCATCCTCCTGGCCT 


atp2alF2 


GCTTTTGGCCGCCTGCATCTCT 


atp2alR2 


GCTCGACGAAGGCAGTGACTGTT 


srlRTFl 


AACGAGCTTCGACAACACGA 


srlRTRl 


ATCATGGAGGACTTGCCCAC 



was more sensitive in identifying differentially expressed genes 
than our previous expression array approach with pbx2/4-MO 
embryos (Maves et al., 2007). Our analysis at the 10-somite stage 
allowed us to assess the earliest muscle differentiation 
requirements for both Pbx2/4 and Prdmla, but we were not 
able to assess the entire fast muscle program because many fast 
muscle genes, such as tnnt, are not activated until later 
developmental stages (Xu et al., 2000). Some myosin heavy 
chain genes are expressed early, but we found the fast myosin 
heavy chain genes to be difficult to analyze by RNA-seq because 
the sequences of the six genes are so similar that sequencing 
reads may not get assigned properly. For example, based on our 
in situ and qRT-PCR analyses, myhzl. 3 should have been 
identified in Fig. 5 as Pbx-dependent and in Fig. 7 in Class 1 
(strong expression levels in pbx2/4-MO;prdmla— /— embryos), 
but myhzl. 3 RNA-seq expression levels were so low in control 
embryos (Fig. 6C) that myhzl. 3 was not identified in the Pbx- 
dependent set. Another gene that was not identified by our RNA- 
seq approach is sox6, a known Prdm la-dependent gene (von 
Hofsten et al., 2008). However, even though sox6 is expressed 
within fast muscle precursors and is regulated by Prdmla and 
Myod (von Hofsten et al., 2008; Wang et al., 201 1), sox6 does not 
appear to be regulated by Pbx2/4 (L.M., unpublished 
observations). Even though these examples are caveats to our 
RNA-seq analysis, they help provide further evidence that Pbx2/4 
and Prdmla regulate shared as well as independent aspects of the 
fast muscle program. 

Why is a balance of factors needed for fiber-type 
differentiation? Like zebrafish slow muscle precursor cells, 
embryonic mammalian and avian myocytes and myofibers 
initially co-express both slow- and fast-type isoforms of many 
muscle differentiation genes (Narusawa et al., 1987; Sweeney et 
al., 1989; Lyons et al., 1990; Condon et al., 1990; Sacks et al., 
2003; Niro et al., 2010). How this mixed slow/fast phenotype 
resolves into slow-twitch and multiple types of fast-twitch fibers 
is not fully understood. Perhaps a balance of factors that activate 
and repress subsets of fiber-type differentiation programs allows 
for increased plasticity in generating a variety of fiber types. 



One outstanding question is whether Pbx and Prdml proteins 
are required for mammalian muscle differentiation. Mouse cell 
culture studies support a pioneer factor role for Pbx proteins in 
mouse skeletal muscle differentiation (Berkes et al., 2004; de la 
Serna et al., 2005), but direct requirements for Pbx proteins in 
mammalian skeletal muscle have not yet been tested. A recent 
analysis of the mouse Prdml conditional knock-out in embryonic 
myogenic cells or myogenic precursor cells revealed normal 
onset of myogenesis and normal expression of fast and slow 
muscle differentiation genes in the absence of Prdml (Vincent et 
al., 2012). This study even addressed functional compensation by 
another Prdml -like factor by showing that expression of a 
dominant-negative Prdml had no effect on early mouse fast or 
slow skeletal muscle development (Vincent et al., 2012). If 
Prdml factors are not required in early mammalian skeletal 
muscle differentiation, it remains a possibility that a different 
factor with similar activity could act in a balance with Pbx 
proteins. In future studies it will be important to determine 
whether Pbx proteins and a Prdm or Prdm-like factor function to 
regulate mammalian skeletal muscle fiber-type differentiation. 

The coordinated expression of contractile gene expression 
programs is critical not only for skeletal muscle differentiation 
but also for myocardial and smooth muscle differentiation. Pbx 
proteins are expressed in myocardial and vascular smooth muscle 
cells and are needed for proper development of the heart and 
cardiac outflow tract (Chang et al., 2008; Stankunas et al., 2008; 
Maves et al., 2009). It is possible that Pbx proteins will act in a 
balance with other transcription factors to promote subsets of 
myocardial and smooth muscle differentiation programs. 

Materials and Methods 

Zebrafish husbandry 

All experiments involving live zebrafish (Danio rerio) were carried out in 
compliance with IACUC guidelines. Zebrafish were raised and staged as 
previously described (Westerfield, 2000). Time (hpf) refers to hours post- 
fertilization at 28.5 °C. In some cases, embryos were raised for periods at room 
temperature, about 25 °C. The wild-type stock used was AB. The prdmla'"** 05 
mutant line has been described and is likely a null allele (Hernandez-Lagunas et 
al., 2005). prdmla"' H05 genotyping primers, prdmladcapsF2 and prdmladcapsR2 
are in Table 1. The genotyping PCR product is 238 base pairs. Digesting with 
XmnI yields a 204 base pair product from the mutant allele. 

Morpholino and mRNA injections 

Morpholino injections were performed, and working concentrations were 
determined, as previously described (Maves et al., 2007). Morpholinos were 
used at the following working concentrations: pbx2-M02, CCGTTGCCTGTGAT- 
GGGCTGCTGCG, 0.25 mg/ml; pbx2-MC)l, GCTGCAACATCCTGAGCACT- 
ACATT, 0.5 mg/ml; pbx2-M03 5-base mis-match control, GCTcCAAgATCC- 
TcAcCAgTACATT, 0.5 mg/ml; pbx4-MO\, AATACTTTTGAGCCGAATC- 
TCTCCG, 0.5 mg/ml; pbx4-MQ2, CGCCGCAAACCAATGAAAGCGTGTT, 
0.5 mg/ml; Standard control-MO, CCTCTTACCTCAGTTACAATTTATA, 
1 mg/ml. pbx2-M02, pbx2-M03, pbx4-MOl and pbx4-M02 were combined to 
make the pbx2/4-MQ combination (3 nl injected). Western analysis confirmed loss 
of Pbx2 and Pbx4 expression in pbx2/4-MO-treated embryos (Maves et al., 2007). 
Embryos that are knocked-down for Pbx2 and Pbx4 function show a 
developmental delay of about 2 somites during somitogenesis stages (Maves et 
al., 2007). We somite stage-matched sibling control and MO-treated embryos 
when collecting embryos for staining or for RNA analyses. 

RNA in situ hybridization and immunocytochemistry 

RNA in situ hybridizations were performed as previously described (Maves et al., 
2007; Talbot et al., 2010). The following cDNA probes were used: prdmla 
(Hernandez-Lagunas et al., 2005); krox-20 (egr2/?-Zebrafish Information Network) 
(Bradford et al., 2011; Oxtoby and Jowett, 1993); mylpfa (formerly mylz2) (Xu et 
al., 2000); myhzl. 3 (formerly finyhcX) (von Hofsten et al., 2008); smyhcl (Bryson- 
Richardson et al., 2005); myog (Weinberg et al., 1996); atp2al (Maves et al., 
2007); srl (Maves et al., 2007). Following staining, embryo heads were genotyped 
for prdmla as above. 
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Whole-embryo immunostaining was performed with anti-pan zebraflsh Pbx, 
1 :500 (Popper! et al, 2000) as previously described (Maves et al., 2007). For DAPI 
staining, DAPI was included in the secondary antibody incubation step at a dilution 
of 1:2000. 

Embryos were photographed using a Zeiss Stemi SV6 and Nikon CoolPix 4500 
digital camera or imaged using a Zeiss Pascal confocal microscope or Leica TCS 
SP5 confocal microscope. Images were assembled using Adobe Photoshop. 

RNA genotyping of single embryos and quantitative (q) RT-PCR 

Embryos were collected from prdmla+/— parents and injected with either control 
MO (pbx2-M03 mismatch or Standard Control MO) or pbx2/4-MO combination. 
Total RNA from individual embryos was isolated using the RNAqueous Micro Kit 
(Ambion). RNA samples were genotyped for prdmla from residual DNA. We then 
pooled RNA samples from about 10 equivalent-genotype embryos for four classes: 
control-MO;prdml +/+; control-MO\prdml —I— ; pbx2/4-MO;prdml +/+; and 
pbx2/4-MO;prdml—/—. These pooled samples were then further cleaned and 
concentrated using the RNeasy MinElute Kit (Qiagen). 100 ng of pooled RNA 
samples plus random hexamers were used in a reverse-transcriptase reaction with 
SuperScriptll Reverse Transcriptase (Invitrogen). Real-time PCRs were performed 
using SYBR Green (Bio-Rad) and an Applied Biosystems 7900HT System 
according to the manufacturer's instructions. The relative expression levels were 
normalized to those of ornithine decarboxylase (odd) (Maves et al., 2007). For 
primers, see Table 1. 

RNA-seq library preparation and sequencing 

Total RNA samples were genotyped and pooled as above for control-MO;prdml+/ 
+; controI-MO;prdml -/-; pbx2/4-MO;prdml+/+; and pbx2/4-MO;prdml-/- 
embryos at the 10 somite (s) stage from three independent sets of egg collections/ 
injections. An additional set of embryos from each set of injections were processed 
for in situ hybridization to confirm somite staging as well as loss of krox-20 
expression in pbx2/4-MO embryos (see Fig. 2A for example). Library preparation 
and sequencing was carried out by the FHCRC Genomics Shared Resource. 
Sequencing libraries were prepared from total RNA using the TruSeq RNA Sample 
Prep Kit (Illumina) according to the manufacturer's instructions. Library size 
distributions were validated using an Agilent 2100 Bioanalyzer. Additional library 
quality control, blending of pooled indexed libraries, and cluster optimization were 
performed using the QPCR NGS Library Quantization Kit (Agilent Technologies). 
RNA-seq libraries were pooled (4-plex) and clustered onto a flow cell lane using 
an Illumina cBot. Sequencing occurred on an Illumina HiSeq 2000 using a single- 
read, 100 base read length (SRI 00) sequencing strategy. 

RNA-seq data processing and analysis 

Image analysis and base calling were performed with Illumina's Real Time 
Analysis vl.12 software. Files were demultiplexed of indexed reads and generated 
in FASTQ format using Illumina's CASAVA vl.8. software. Reads were removed 
that did not pass Illumina's base call quality threshold. Reads were aligned to 
zebraflsh genome build Zv9 release 63, using TopHat 1.4 (Trapnell et al., 2009). 
SAMtools v0.1.18 (Li et al., 2009) was used to sort and index the TopHat 
alignments such that they could be visualized in the Integrative Genomics Viewer 
(Robinson et al., 2011). The gene expression profiles of control -MQ;prdml+/+ 
versus control-MO;prdml—/—; pbx2/4-MO;prdml+/+; and pbx2/4- 
MO;prdml —/— embryos were compared using the Bioconductor package edgeR 
(Robinson et al., 2010). To include reads that mapped to multiple homologs in the 
zebraflsh genome, including the myosin heavy chain genes, reads with multiple 
hits on the genome were kept, and each hit received a fractional count equal to one 
over the number of hits. Venn diagrams were generated using BioVenn (Hulsen et 
al., 2008). Data have been deposited in NCBI's Gene Expression Omnibus and are 
accessible through GEO Series accession number GSE45532. 
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